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Abstract 



The TATA-box Binding Protein (TBP) is required by all three eukaryotic RNA polymerases 
for the initiation of transcription from most promoters. TBP recognizes, binds to, and bends 
promoter sequences called "TATA-boxes" in the DNA. We present results from the study of 
individual Saccharomyces cerevisiae TBPs interacting with single DNA molecules containing 
a TATA-box. Using video microscopy, we observed the Brownian motion of beads tethered 
by short surface-bound DNA. When TBP binds to and bends the DNA, the conformation 
of the DNA changes and the amplitude of Brownian motion of the tethered bead is reduced 
compared to that of unbent DNA. We detected individual binding and dissociation events 
and derived kinetic parameters for the process. Dissociation was induced by increasing 
the salt concentration or by directly pulling on the tethered bead using optical tweezers. 
In addition to the well-defined free and bound classes of Brownian motion, we observed 
another two classes of motion. These extra classes were identified with intermediate states 
on a three-step, linear binding pathway. Biological implications of the intermediate states 
are discussed. 

Key words: tethered-particle-motion; single-molecule; optical tweezers; video microscopy; 
transcription initiation 
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INTRODUCTION 

Observation of biological systems on the single-molecule level can reveal information that 
is not easily obtainable in ensemble studies. By studying individual molecules it is possible 
to address whether an observed variation in activity is caused by temporal variations in the 
individual molecules or by variation in activity from molecule to molecule (0, 0)- Also, 
single-molecule techniques often allow the application of a mechanical force to the system 
and thus the introduction of a well defined reaction coordinate (0)- 

The TATA-box Binding Protein is a small, single-chain, saddle-shaped protein (j^, the 
DNA-binding part of which is highly conserved throughout evolution. The inner, concave 
side of TBP directly contacts the DNA, whereas the outer and evolutionary more variable 
side interacts with various other proteins involved in the regulation of transcription (0). With 
TBP bound, the DNA is bent by ~80° and locally unwound by ~120° (0,11). DNA distortion 
is thought to play a role in transcription initiation, the distortion influencing recruitment 
and stabilization of RNA polymerases and associated proteins. Furthermore, this distortion 
may have a direct mechanical role in chromatin remodeling: Histones have been shown to 
slide along the DNA upon TBP binding (0). Association of TBP with promoter DNA is 
a slow process, but after binding the complex can support multiple rounds of transcription 
initiation (jioi ). 

Most DNA-binding proteins interact with DNA through the major groove, where the 
base-paired sequence is easily accessible. TBP, however, interacts with DNA through the 
minor groove ((3, 0) in a manner that differs also from that of other minor-groove-binding 
proteins di ll). The TBP-DNA interaction is well studied at the ensemble level: Bending 
angles (0,11 111) as well as the specificity of the binding (lillillillillillilQto differ- 



ent DNA sequences under various conditions of pH, temperature, osmolyte, and electrolyte 
concentrations have been revealed using X-rays, electron microscopy, gel-retardation, DNase 
I foot-printing, fluorescence anisotropy, and Forster Resonance Energy Transfer (FRET). 
However, no previously reported studies have addressed this system at the single-molecule 
level. 

TBP is known to bind to several consensus and non-consensus TATA-boxes (jlih . The 
best studied TATA-box is the adenovirus major late promoter (AdMLP), which serves as 
a reference example of TBP-DNA interactions. When binding to the AdMLP, TBP has to 
overcome an activation barrier of nearly 10 kcal/mol, but once bound, the protein resides in 
an energetic minimum that . almost 11 kcal/mol deep £3) . A careful analysis of ensemble 
data taken at a range of temperatures and protein concentrations led to a prediction of two 
intermediate states on the reaction pathway between the initial, unbound state and the final, 
bound state ([itI). The structure of the TBP-DNA complex in these intermediate states is 
not known, but was proposed (0) to be nearly identical to the final bound form. However, 
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direct observation of these intermediates has not previously been reported. 

We set up a tethered-particle-motion system to study the interaction between 

TBP and DNA at the single-molecule level. A microsphere ("bead") was tethered to a 
microscope coverslip by 324 bp of DNA. In the center of the DNA sequence we placed a 
TATA-box, the rest of the DNA contained no TATA-like sequences. Using video microscopy 
we followed the Brownian motion of the bead and from this motion constructed a measure for 
the conformation of the DNA tether. Upon introduction of TBP, we observed a decrease in 
Brownian motion and interpreted this decrease as the binding of TBP to the DNA. A series of 
control experiments were undertaken to ensure that the observed effect was caused by active 
TBP, binding specifically to the TATA-sequence. We showed that TBP can be forced off the 
TATA-box sequence by increasing the electrolyte concentration or by mechanically pulling on 
the DNA using laser tweezers. The choice of a short DNA-tether and relatively large beads 
ensured that any change in conformation of the DNA was amplified into a larger change 
in the position of the tethered bead. Using this system, we studied the binding kinetics of 
TBP to DNA and showed by direct observation that at least one of the intermediate states 
is less bent than the final state. This constitutes the first direct observation of a structural 
intermediate on the TBP-DNA binding pathway, a finding that may have implications for 
our understanding of the regulation of transcription initiation. 



MATERIALS AND METHODS 



DNA 



DNA-tethers for binding experiments ( "TATA-DNA" ) were engineered to have the AdMLP 
sequence 5'-TATAAAAG-3' at position 155. 324-bp double stranded DNA labeled with 
digoxigenin at one end and biotin at the other was produced by 10 rounds of polymerase 



chain reaction (PGR) amplification from pSFl (see ()21[ ) for details). DNA-tethers without 
the TATA-box ( "control-DNA" ) were produced by site-directed mutagenesis using PGR to 
have the Xhol recognitions sequence (underlined) 5'-TACTCGAG-3' at position 155. We 
screened the DNA for consensus as well as non-consensus TBP binding sequences known 



from the literature (|18|, 122, |23|, |2J, |25|, |26|). In the TATA-DNA we found no high-affinity 
sequences other than the AdMLP TATA-box and in the control-DNA we found no high- 
affinity sequences. 
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Beads 

The beads used were streptavidin coated polystyrene spheres of diameter d = 0.46 /im (BangsLabs), 
as well as 1.0 /im streptavidin coated silica spheres (Spherotech). One end of the DNA tether 
was specifically attached to the avidin coated micro-sphere by biotin-streptavidin binding. 
The other end was attached to a glass coverslip by digoxigenin/anti-digoxigenin interaction. 
Nuclease-free bovine serum albumin (BSA; Roche Applied Sciences) was added to the beads 
to a final concentration of Img/ml and incubated for >14h at 4°C before use, to suppress 
sticking to the glass coverslip. Figure [T] shows a schematic drawing of the setup. 



TATA- box Binding Protein 

Saccharomyces cerevisiae TBP was prepared as described in (jlii |27[ ). The protein was 
stored at -80°C, at a concentration of 33.9 /iM, in Buffer A (25 mM Hepes-KOH, 2.7 M 
glycerol, 1 mM EDTA, 1 mM DTT, pH 7.9) until immediately before use, at which time 
it was thawed, diluted in Buffer 1 (10 mM Tris-HCl, 100 mM KCl, 2.5 mM MgCla, ImM 
CaCl2, ImM DTT, pH 7.4) to the working concentration of 50-678 nM, and kept on ice 
between experiments. The stock concentration of TBP was determined from the absorbance 
at 280 nm using an extinction coefficient of 12,700 M~^cm~^ (127^ . TBP is monomeric under 
the conditions and concentrations used in the experiments described here (17. 2 



Preparation of Samples 

Microscopy flow cells (~20 yul) were prepared by placing one coverslip No. on top of another, 
separated by spacers made from a single thickness of Parafilm (American National Can, 
Menasha, WI, USA) coated with silicone vacuum grease (Dow-Corning). Anti-digoxigenin 
was attached to the working surface by injecting the flow cell with 20 /ig/ml anti-digoxigenin 
in PBS and incubating at 4°C for >14 hours. The flow cells were then washed with 3x100 /il 
Buffer 1, incubated with DNA for 15min, and again washed with 3xl00/il Buffer 1. To 
suppress non-specific binding of beads and TBP to the surface, the flow cells were incubated 
with 2mg/ml BSA in Buffer 1 for 15min saturating all surfaces with BSA. Samples were 
mounted on a microscope for observation, and beads were flowed in at high concentrations 
(40-160 pM) and allowed to form tethers for 20min before unbound beads were removed by 
washing with 3-5xl00/xl Buffer 1. 
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Video Microscopy and Optical Tweezers 

Time-lapse video images for the experiments using silica beads were acquired using a mod- 
ified microscope (Leica DM IRB, lOOx oil-immersion objective, (jl^)- Bead positions were 
determined in each video-frame using a cross-correlation type tracking algorithm written 
in MatLab (The MathWorks, MA, USA) The laser tweezers experiments were also 

performed on this setup. 

To zoom in (temporally) on the binding step, a different microscope was used (Nikon 
Eclipse TE300, 60x water immersion objective, (|30|)). Bead positions were determined by 
using a thresholded centroid-tracking method for each of the two de-interlaced video-fields 
that comprise a video-frame, thus obtaining a time resolution of 1/50 s. 

Brownian Motion Measure 

The shortness of the DNA ensured that tethered beads stayed in the focal plane of the 
microscope. Motion perpendicular to the surface was not detected. Motion parallel to the 
surface, in the {x, |/)-plane, was detected using one of the two above mentioned tracking al- 
gorithms. Time-series of [x, |/)-positions were broken into non-overlapping segments and the 
variances, cx^ and a^, were calculated in each segment. As measure for the Brownian motion 



of a tethered bead, we chose the root-mean-square-deviation (RMSD), B = y (c^ + '^y)/'^- 
Additional measures were applied that allowed us to distinguish between beads tethered by 
one, two, or more DNA- molecules (see Appendix lA. II and IA.2[ and Fig. IHl). 

Establishment of Method 

To investigate the nature of the tether between the bead and the coverslip, and the effect of 
TBP, we did a series of control experiments: 



A concentration series of DNA showed that the number of tethered beads varied linearly 
with the amount of DNA, with few or no beads tethered when no DNA was present and 
saturation at high DNA concentrations. Flow cells prepared without anti-digoxigenin lead 
to a >90% decrease in the number of tethers formed, as did pre-blocking the streptavidin 
coated beads with saturating amounts of biotin. 




Tethers 
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Effect of glycerol 

The presence of osmolytes, e.g. glycerol and glucose, have been reported to dramatically 
increase both binding affinity and bending angle for TBP binding to DNA (0). Since our 
protein was stored in a glycerol-rich buffer, we decided to investigate the effect of glycerol 
on our single-molecule experiments. We added 226 nM TBP in 0.91 M glycerol (1/3 Buffer A 
and 2/3 Buffer 1) to flow cells with beads tethered by either TATA-DNA or control-DNA. 
Essentially all of the tethered beads showed large and instant decreases in Brownian RMSD, 
making it impossible to distinguish between beads tethered by TATA-DNA and beads teth- 
ered by control-DNA. To avoid this effect, all experiments reported hereafter were conducted 
at 4-18 mM glycerol. This is within the range of intracellular concentrations found in S. 



cerevesiae at different growth stages (|3lr). and below the concentration where bend-angles 
are expected to be affected (fl^. The effect of osmolytes on the binding to nonspecific versus 
target DNA was recently reported in detail for the lac repressor system (|3^. 



Protein 

TBP not competent in binding DNA might interact with the bead directly by sticking to 
the glass and the bead simultaneously, or indirectly through entropic effects (jl^. Such 
interaction would influence the motion of the bead and hence also the conformations of the 
DNA. To clarify whether any such effects were present, we conducted experiments with 
heat-inactivated protein: TBP was incubated at 100°C for 5min, allowed to cool to room 
temperature, and then added at a concentration of 226 nM to microscopy samples with 
either TATA-DNA tethers or control-DNA tethers. No change in the Brownian RMSD of 
the tethered beads was observed in samples where inactivated TBP was added. 



Specificity of TBP binding 

To investigate whether the interaction between TBP and DNA was confined to the TATA-box 
of the DNA we used the control-DNA described above. Any effect of TBP on the Brownian 
motion of beads tethered by control-DNA would be caused by non-specific interactions be- 
tween TBP and DNA, since no TATA-box is present in this tether. At low concentrations 
of TBP (<100nM) we detected no change in the Brownian RMSD of beads tethered by 
control-DNA. At intermediate concentrations (226 nM) a small fraction of beads tethered 
by control-DNA showed brief (~lmin), minor (~10nm) decreases in Brownian RMSD. At 
high concentrations of TBP (678 nM) beads tethered by control-DNA showed abrupt de- 
creases in Brownian RMSD, often resulting in the bead becoming irreversibly stuck on the 
coverslip — this behavior was also observed for beads tethered by TATA-DNA. We interpret 
this latter high-concentration-behavior as the "collapse" of DNA induced by non-specific 
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action of the protein, a process also observed in the non-specific interaction of other proteins 
with DNA To collect the data summarized in Fig. IHlwe used 226 nM TBP, and for 

the data shown in Fig. |3]we worked at 50-100 nM TBP. 



RESULTS AND DISCUSSION 

Binding and Dissociation Events 

Figure El shows the Brownian RMSD, as a function of time, for a single DNA-tether containing 
a TATA-box: Initially (negative times) no TBP was present and i3 ~ 75 nm; 200 s after 
addition of TBP, B decreased abruptly to ~50 nm; finally, after exposure to 1 M KCl and 
washing with Buffer 1, the Brownian RMSD returned to 75 nm. We interpret this type of 
behavior as the specific binding of TBP to the recognition sequence in the middle of the 
DNA, concurrent with a decrease in Brownian RMSD (henceforth referred to as a "binding 
event"). Addition of a high concentration of salt was followed by an increase in Brownian 
RMSD. We interpret this increase as the release of TBP from the TATA-box (henceforth 
referred to as a "dissociation event"). The inset in Fig. |21 shows a scatterplot of the positions 
visited by the DNA-tethered bead. The recorded positions visited before TBP bends the 
DNA-tether are shown as grey points connected by lines. Black dots and lines show the 
recorded positions visited after TBP has bound to and bent the DNA-tether. The positions 
visited by the bead are isotropically distributed in the (x, ?/)-plane, both before and after 
addition of TBP. 

The Brownian RMSD, both before and after addition of TBP, varied from one tether to 
the next. In n = 48 observed binding events using DNA-tethered silica beads the Brownian 
RMSD decreased from 69±13nm to 46±12nm (mean ± SD). That is, the step-size, the 
change in Brownian RMSD upon binding by TBP, was 25 ± 9 nm. 

When flowing in 1 M KCl we consistently, after a brief waiting time, saw B return to 
the value it had prior to the addition of TBP. We interpret this as the release of individual 
molecules of TBP from single DNA-tethers. Any non-specific, as well as specific, interaction 
keeping the TBP in contact with the DNA under normal conditions is suppressed at this 
concentration of salt because the electrostatic interactions are screened. Thus, once no longer 
bound to the TATA-box, the protein is expected to diffuse away from the DNA-tether. 

Another way to force TBP off DNA is by stretching the tether using an externally ap- 
plied mechanical force (|36h. As a proof of principle, experiments were performed using laser 
tweezers to pull on the bead, thereby stretching the DNA and forcing the protein off. These 
experiments are described in Appendix IA.31 Our observations are in accordance with previ- 
ously published results, showing that proteins do not stay as readily attached to stretched 
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DNA as to relaxed DNA 
Binding kinetics 

We investigated the binding kinetics by measuring the time elapsed between the addition of 
TBP and the first detected decrease in Brownian RMSD. In Fig. El a histogram of these 
waiting times is shown. If the binding of protein to DNA has a constant probability of 
success per time interval it is a Poisson process. The waiting times are then exponentially 
distributed. A fit of a single exponential to the data returned a time-constant of 143 s. The 
corresponding second-order association rate constant is ka = [TBPj^^r^^ = (3.1 ± 0.5) x 
10^M~^s~^ (mean ± SE, n = 45). This value is in rough agreement with values determined 
in ensemble experiments performed under similar buffer conditions and temperature {ka = 8- 



14 X 10^M~^s~^ ( 171 l25ri). The difference, if any, could be caused by a lower than assumed 
concentration of binding-competent TBP in our experiments — the presence of surfaces is 
expected to diminish the amount of TBP through adsorption and denaturation (|39[ l4ol ). 
Furthermore, the protein is very sensitive to repeated freeze/thaw cycles: We kept these to 
a minimum, but the experimental protocol meant that a few cycles of freeze/thaw could not 
be avoided. Finally, the proximity of the tethered bead to the coverslip gives rise to a force 
on the DNA-tether due to excluded volume effects (Q). This force is expected to suppress 



the rate of association by 25-50% (see Appendix IA.4 



Dissociation kinetics 



Characteristic times for the single-exponential decay of the TBP-AdMLP complex have been 
reported in the range from 10 to 170 min (14, 3, 17, 2^ i^)- The existence of an additional. 



fast phase in the dissociation, nearly two orders of magnitude faster than the dominant slow 



phase, was revealed only by detailed kinetic studies ()l7l ). 

We did not determine the kinetics of spontaneous dissociation in our experiments. An 
experiment was typically ended after 15-20 min by exchanging the assay-buffer with IM KCl, 
and then looking for dissociation events. We did this to confirm that any observed change in 
Brownian RMSD was caused by TBP-DNA interactions. Furthermore, tethered beads had 
a tendency to get stuck on the coverslip after extended observation, thus terminating any 
further observation. 



Intermediate States 



To further investigate the binding kinetics, we modified the experimental setup. By using 
polystyrene beads of a smaller diameter (0.46 /xm instead of 1.0 //m) as well as separately 
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analyzing each of the two video-fields that constitute a video-frame, the temporal resolution 
was increased. The TBP concentration was 50-100 nM, i.e., lower than in the experiments 
described above. At these protein concentrations approximately 50% of the tethers showed 
binding events. The expected equilibrium dissociation constant for the TBP-DNA interaction 
is 16 nM, with the buffer and temperature conditions used here (jlTl ). However, due to the 
previously mentioned effects of surfaces, freeze/thaw cycles, and tension in the DNA, the 
observed single-molecule dissociation constant is expected to be higher. During 10 out of 
11 observed binding events, the Brownian RMSD was observed to decrease and sometimes 
increase in a stepwise manner in the presence of TBP, see Fig. |3] The kinetic scheme 
suggested in ([itI ) (boxes added by us): 



DNA + TBP 




h 








k4 



h ^ DNA:TBPfi„ai 



n 



M 



predicts the existence of two intermediate states Ji and /2, on the path to the final, bound 
state. The intermediates and the final complex were all postulated to have the same bend 
in the DNA and to differ only in their stability (^3)- 

In an experiment where only the first and the last state is resolved the forward reaction 
can be described by a second-order association rate constant ka that depends on the micro- 
scopic rate constants /cj, z = 1. . . 6 (0). This rate-constant ka is the one cited above, because 
we were not able to discern intermediates using the silica beads (chosen for their high optical 
trapping efficiency, see Appendix IA.3|) In the following we distinguish between the observed 
classes of Brownian motion, and their interpretation in terms of different underlying states 
of the TBP-DNA complex. 



Three classes of Brownian motion were observed in the histograms 

Figure |3] shows three examples of the time-development of B as well as histograms of B. 
Each histogram shows a distribution of the Brownian RMSD with three separate peaks. 
Based on such histograms we divided the Brownian motion into three classes. Each of these 
three classes must correspond to a different conformation of the TBP-DNA complex: i) The 
7Y-class is defined from the Brownian RMSD observed before addition of TBP. Thus, the 
Ti-class corresponds to DNA that is not bound by TBP, but includes also the transient 
encounter complex that is implicit in all bimolecular reactions, ii) The £-class is defined 
from the Brownian RMSD observed several minutes after addition of TBP. We identify this 
class with a superposition of the final bound state and the second intermediate state I2 (see 
discussion below), iii) The A^-class was a surprise. It is defined from the Brownian RMSD 
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that was of a magnitude midway between the Ti, and C classes. We interpret this class as 
corresponding to the first intermediate state /i. 

Each of these three classes is indicated by a box in the kinetic scheme shown above. The 
observed Brownian RMSD in each class was: 71 ± 7nm (7i-class), 56 ± 7nm (7M-class), 
and 46 ± 10 nm (£-class) respectively (means ± SDs, n = 10 individual tethers). The three 
classes differ from each other with statistical significance: Ti = Ai with P = 8.3 x 10~^, 
M = C with P = 6.3 X 10~^, and H = C with P = 1.6 x 10"^ using right-tailed t-tests. 



A fourth class of Brownian motion was observed in the time-series 

The histograms do not show a second intermediate with a Brownian RMSD different from 
that of the three classes already mentioned. Assuming the existence of a second intermediate 
state I2, this lack of its observation suggests that the I2 state is structurally similar to the 
final-bound state and for that reason does not show up as a separate peak in the histograms 
of B. However, there is valuable extra information in the time-series of B compared to the 
histograms of B: The temporal development of B in Fig. |3] shows an initial period of multiple 
transitions to and from the lower class C (e.g., panel A, t E [100 : 325]), followed by a quies- 
cent period with no transitions (panel A, t G [325 : 650]). That is, we observed two classes 
of C with the same Brownian RMSD but with different stability. We interpret this as the 
existence of two states of the TBP-DNA complex with equally bent DNA, but with different 
stabilities. We identify the first period (fast dynamics) with the second intermediate state 
I2, and the second period (slow dynamics) with the final-bound state. The rate constants 
determined in fllTh indicate that the second intermediate should indeed be populated for only 
very short periods of time (mean-occupancy-time = (/C4 + k^)"^ = 1.2s), whereas the final 
state is populated for a much longer time (l/Zcg = 282 s). Without a second intermediate we 
cannot explain the observed rapid transitions from the C class. 



Comparison of rate constants to ensemble values 

From manual inspection of ten time-series of B we estimated the two microscopic rate- 
constants ki and ^3, and the macroscopic rate-constant ka- From the time spent in the Ti. 
class, after addition of TBP and until a transition to the A4 class, we found ki = 1.6 ± 
0.5 mM~^s~^ (mean ± SE, n = 10), in agreement with the ensemble value (1.59/iM~^s~^, 
(|l7|)). From the A1 class we measured transitions to the C class and found k^ = 54± 16 ms~^ 
(mean ± SE, n = 11) again in reasonable agreement with the ensemble value (~30ms"^, 
f)l7l )). Finally, from the waiting time between addition of TBP and until reaching the final- 
bound state we found ka = (9 ± 3) x 10^M~^s~^ (mean ± SE, n = 8), in agreement with 



reported ensemble values ()17L |25|) and with the value we found using silica beads. The 
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remaining microscopic rate-constants could not be determined with sufficient accuracy, due 
to the finite time-resolution and sample size, to allow comparison with ensemble data. 

What are the Structures of the Intermediates? 

Co-crystal structures of TBP bound to the AdMLP TATA-box show two sharp kinks in the 
DNA (0, lllh . The first kink occurs where two phenylalanine residues intercalate between the 
first two base-pairs. The second kink occurs where another pair of phenylalanine residues is 
inserted between the last two base-pairs (5'-T'^ATAAAA^G-3', triangles indicate intercala- 
tions). Both sets of intercalations produce DNA-kinks of ~45° (fzi lllh. Based on the crystal 
structure, the most straightforward interpretation of our data assigns one pair of intercala- 
tions to the A^-class and another pair to the £-class. The high flexibility of the 5'-end of 
the TATA-box implies that this is where the first pair of intercalations is most likely to take 
place (jisl). whereas the more rigid 3'-end (A-tract) is expected to delay the second set of 
intercalations. We therefore propose the following sequence of events: Step One, leading to 
/i, consists of the intercalation of one pair of phenylalanine residues in the upstream, 5'-end, 
of the AdMLP. Step Two, leading to I2, consists of the intercalation of the second pair of 
phenylalanine residues in the downstream, 3'-end, of the AdMLP. Step Three, leading to 
the final complex, has no major structural change, but consist of a slight rotation of one 



TBP domain relative to the other as well as the formation of van der Waals' contacts 
between the minor groove of the DNA and the concave surface of the TBP, leading to the 
stable structure known from crystallography (0,0)- 

Assuming a three-step pathway there are two other possible assignments for the two 
pairs of intercalation events, both of which have previously been proposed: The first set of 
intercalations takes place in step Two and the second set in step Three (jlTl ). Alternati vely , 
the first set of intercalations take place in step One and the second set in step Three ()25| ). 
Based on the data presented in this paper we favor the first model over the latter two. 
However, we emphasize that it is not possible to make any definitive distinction between 
these three models based on the existing data — all assignments of structures to states are 
speculation, so far. 

What are the Biological Implications of the Intermediates? 

From in vitro experiments a picture of the assembly of the transcription pre-initiation com- 



plex (PIC) has emerged ()44l ). In this picture, assembly takes place in steps: TBP binds to 
DNA, followed by the binding of TFIIB, after which a preformed complex that includes the 
polymerase is recruited. Later yet, additional factors bind to the complex. It is possible for 
TFIIA to enter the PIC at any point after TBP. Now, assume that the /i-state exists and 
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has a single kink in the upstream part of the TATA-box. This will allow TFIIA to bind 
stably to the complex, since this factor makes contacts only with upstream DNA-sequences 
and TBP (j45[ |46[ ). On the other hand, TFIIB makes contacts with both upstream and 
downstream DNA, as well as the TBP ()47l). In the one- kink Ji-state, geometry dictates that 
proximal to the TBP, upstream and downstream DNA is further apart than in the two-kink 
/2-state. In the Ji-state therefore, the formation of a stable association between TFIIB and 
DNA will presumably be suppressed. In the J2-state both pairs of phenylalanine interca- 
lations are in place, the DNA is fully bent, and TFIIB can form the contacts known from 
its crystal structure (j47|). Thus, we arrive at a picture in which assembly of the PIC can 
proceed already from the Ji-state, and in which the structural conformation of Ji suggests 
an ordering of events with TFIIA binding before TFIIB, see Fig. [31 . The kinetics of the 
Ji-state may facilitate the correct orientation of DNA-bound TBP, as discussed in (jlTI) . 



CONCLUSIONS 

We made single-molecule experiments investigating the specific binding of a TATA-box Bind- 
ing Protein to DNA. In the experiments, beads were attached to a surface by short DNA- 
tethers and underwent restricted Brownian motion. When bound by protein the DNA was 
bent and the Brownian RMSD of the tethered beads decreased. With this setup we measured 
kinetic parameters describing the binding of TBP to DNA. 

Intriguingly, changes in Brownian motion during a binding event revealed the existence of 
two intermediate states on the binding pathway. This constitutes the first direct experimental 
corroboration of a model suggested in (0), in which the kinetic pathway of the TBP-DNA 
interaction has two such intermediates. By direct observation of individual departures and 
their time of occurrence, we measured kinetic constants describing rates to and from the 
intermediate states. These rates were in agreement with the model-dependent rates derived 
in IitI ) and thus support the kinetic scheme given there. However, contrary to what is 



speculated in ()17l l25l). we found that the DNA is less bent in the first intermediate than in 
the final complex. This, in turn, might have implications for the order of the assembly of 
the transcription pre-initiation complex, favoring the association of TFIIA before TFIIB. 

The results presented here prove that it is possible to make time-resolved observations 
of single binding and dissociation events of TBP to promoter DNA. This shows that DNA- 
distortions that are much less dramatic than e.g. the looping induced by the lac repressor 
(jiih . can be reliably detected using well established single-molecule techniques. Furthermore, 
the work presented here opens the door to a number of studies of the system, such as 
quantitative measurements of the force and torque dependence of rate-constants describing 
the TBP-DNA interaction. 
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A Appendix 

A.l Brownian Motion Measure 

The time-series of (x, ?/)-positions for the DNA-tethered bead was broken into non-overlapping 
segments. Each segment contained a distribution of positions in the {x, ?/)-plane for which 
we calculated the tensor of inertia. In two dimensions, the tensor of inertia is a two-by-two 
matrix 

i=f^;- . (1) 

V -^y^ ^yy / 

The principal moments are the entries of the diagonalized matrix, and we denote them /min 
and /max in order of increasing magnitude. They are: 

-^min = 2 ~^ ^yy ~ \J i.-^xx ~ lyy)'^ + '^'^^y } 

-^max = 2 ~^ ^yy ~^ \J i-^xx ~ ^yyY "I" ^-^xy^ ' (2) 

where Ixx — Yli=iiyi~y) ' -^yy ~ n-i X/i=i('^«~-^) ' -^^y ~ ^y^ ~ n-i X^«=i('^«~'^)(2/j~^)' 
Xi and Ui are the recorded positions of the tethered bead, and x and y are averages calculated 
from the positions in each segment. The sum of the principal moments equals the sum of 
the variances along the x and y axes: /min + Imax = + • As a measure for the amplitude 
of the Brownian motion of a tethered bead we used B = a/ (/min + -^max)/2, and as a measure 
for the isotropy of the bead-motion we used the ratio r = \/ 1 mm/ 1 max- Thus, B is the RMSD 
for the positions visited by a tethered bead. The isotropy-measure was approximately 50- 
100% for a bead with only a single DNA tether. If r was consistently smaller than ~50% 
we interpreted this as a bead tethered by two DNA molecules, a polystyrene link, or some 
other, non-specific interaction, and discarded the data. An example of non-isotropic motion 
is shown in Fig. IHl 



A. 2 Multiple Tethers 

We varied the DNA concentration during sample preparation and observed that the fraction 
of beads with multiple tethers increased as a function of DNA concentration. Inspection of 
scatter plots of bead positions revealed several classes of motion: At low DNA concentrations, 
the vast majority of scatter plots was isotropic; as the DNA concentration was increased some 
of the scatter-plots were observed to be anisotropic; see Fig. El left panel. At even higher 
DNA concentrations, approximately isotropic plots with a small radius began to appear along 
with the previously described shapes. These observations are consistent with the formation 
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of one, two, or more tethers per bead: One tether gives rise to an isotropic scatter plot, two 
tethers yields an elongated, less isotropic, scatter plot, and three or more tethers result in 
roughly isotropic scatter plots, but with a small radius. The ease with which multiple tethers 
are detected owes to the fact that the DNA tether is short, just two persistence lengths. 

An example is shown in Fig. IHl This scatter-plot of bead-positions indicates that two 
DNA-molecules tethered the bead. Addition of restriction enzyme lead to a step-wise change 
in Brownian motion, with interpretation: Initially the bead was tethered by two DNA- 
molecules. The presence of two tethers broke the rotational symmetry of the setup and 
forced the bead to move in a quasi one-dimensional fashion. After 120 s an enzyme cut one 
of the DNA tethers and rotational symmetry was restored. After an additional 40 s the 
other DNA tether was also cut, and the bead diffused away. See movie in Supplementary 
Information. 

A. 3 Laser Tweezers Stretching Experiments 

Laser tweezers were used to stretch DNA tethers with TBP attached, thereby forcing TBP 
off the DNA tether. For this experiment, flow cells were prepared as described in 'Methods', 
with streptavidin coated silica beads attached to TATA-DNA tethers. The protocol included 
the following steps: i) TBP was flowed in and allowed to bind to the DNA; ii) unbound TBP 
was removed by flowing through 3 x 100 /xl Buffer 1; iii) tension was applied to the DNA- 
tether. The Brownian motion of the tether was measured before and after each of these 
steps. Tension was applied by moving the laser back and forth over the tethered bead six 
times: The peak-to-peak distance of the motion was 5 /im and the speed was kept constant 
at 0.1 nm/s, i.e., this procedure lasted 5min. The maximum force exerted on the silica bead 
by the laser tweezers was estimated to be 38±4pN by an escape-method calibration (jiol ). 

Figure [7| shows how the Brownian motion changed during the laser tweezers experiment. 
Before TBP was flowed in (at t = seconds), B was equal to the length of a normal, unbent 
tether. After TBP was flowed in, it bound to DNA. Unbound TBP was washed out 20 min 
after it was flowed in, the Brownian motion was measured, and the laser tweezers were applied 
for the 5 min stretching procedure. After application of the laser tweezers, B returned to the 
value of an unbent DNA tether. We interpret this length-change as the dissociation of TBP 
from DNA. Due to irreversible sticking of the bead to the coverslip, the Brownian motion of 
the tethered bead was determined in only four cases after application of the laser tweezers. 
Results similar to those shown in Fig. [7| were found in all cases. 
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A. 4 Weak Entropic Forces Tense the DNA-Tether 

In tethered-particle-motion-experiments no external forces are applied. However, the config- 
uration of the system gives rise to a weak entropic force that tends to stretch the DNA-tether: 
If no tether were present, the bead would diffuse away (see movie in supplementary infor- 
mation). Since it does not, a force must be acting on the bead. The tether is mediating this 
force, hence is under tension. To find the tension in the DNA we first write down the par- 
tition function Z for the system, i.e., the sum of all possible configurations in which we can 
find our system (ignoring all gravitational, inertial, electrostatic, and hydrodynamic effects): 

n/2 i'2-K 



Z{i,R) = / g{l)dl / Ide /sine 
Jo Jo Jo 



nax ^27r 

Rda RsinadP , (3) 
Jo 

where amax = cos~^(l — cos 6'), g{l) oc e"^^*^'-* is the Boltzmann weight-factor, and E{1) is 
the energy associated with the tether extension (see Fig. |S)). Integrating over a, (3, 6, and (p 
we are left with: 



Z{£,R) = 2tx'R / rg{l)dl , (4) 
Jo 

In the Micro-Canonical Ensemble, the particle number and energy of a system is fixed; in 
the Canonical Ensemble only the particle number is fixed; in the Grand-Canonical Ensemble 
both the the particle number and energy can vary. Our system consists of one DNA-molecule 
and one bead, and this number is fixed. However, the system is in thermal contact with the 
buffer, so its energy can vary. Thus, our system is described by the Canonical Ensemble and 
the free energy H of our system is 

H = -k^TlxiZ , (5) 

where k-Q is Boltzmann's constant and T is the absolute temperature of the buffer solution. 
From this expression we directly find the tension -Fdna by differentiating with respect to £ 

dH ksTdZ 

Fdna - - ■ (6) 

If the tether is a stiff rod of length i, g{l) oc 5{l — i), where S is Dirac's delta function. In 
this case the tension is given by the simple expression 



(7) 
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Because work is done against a force when DNA is bent by TBP, the rate of association 
is expected to be reduced. Making the approximation that the association rate-constant 
decreases as 

= k,e-^^^^^'/^^^ , (8) 

where /cq is the rate measured in bulk with no external forces applied, we estimate that 
the association rate should be reduced by 25-50% due to tension in the DNA-tether (jH3): 
In bulk, the end-to-end distance of ~300bp dsDNA is approximately 15% shorter than its 



contour length ()5lf ). Thus, when no external force is applied, the end-to-end distance of a 
324 bp DNA is expected to be 94 nm, assuming a 0.34 nm axial rise per bp (|5^. If we model 
the DNA as a stiff rod of length i = 94 nm, an 80° kink in the middle of the rod, decreases 
the end-to-end distance of the rod by Ai = 22 nm. The actual change in end-to-end distance 
is likely to be somewhat smaller due to the flexibility of the DNA and the presence of the 
force -Fdna- A lower limit of A£ > 7.3 nm is set by FRET experiments 
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SUPPLEMENTARY INFORMATION 

An online supplement to this article can be found by visiting B J Online at http: / / www.biophysj .or^ 
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FIGURE LEGENDS 

Figure [T] 

Scale-drawing of 0.46 fim bead tetliered to surface by 324 bp of DNA, side view. The full-line 
circles illustrate the extremal positions the bead can take when the DNA is straight. The 
dashed-line circles show the extremal positions the bead can take when the DNA is modeled 
as two stiff rods at right angles to each other. The difference between the positions of the 
center of the bead at the two extrema is nearly 100 nm. The Brownian RMSD is a measure 
for the variance in the bead's x, |/-positions. Thus, the change in Brownian RMSD upon 
bending of the DNA will be less than the change in extremal positions. 

Figure [21 

Example of TBP binding to and dissociating from TATA-DNA. Main panel: Time-series 
of the Brownian RMSD of a DNA-tethered 1.0/im silica bead. Discontinuities of the time- 
series, due to buffer exchange, are indicated by white spaces on the time-axis. Line: Brownian 
RMSD calculated in non-overlapping 2 s (50 video-frames) windows. 30 fil 226 nM TBP was 
flowed through at t = (first arrow); the DNA-tether was bound by TBP after t = 210s; 
30 /il 1 M KCl was flowed through at t = 760 s; 300 fil Buffer 1 was flowed through at t = 850 s, 
around which time the dissociation took place (second arrow). Inset: Positions visited by 
the tethered bead before (grey, t E [112 : 210] s) and after (black, t e [210 : 270] s) TBP 
bound to the DNA. The time of binding was determined from the time-series of B (main 
panel) . 

Figure |31 

Distribution of waiting times between addition of TBP and observation of a binding event. 
Abscissa: time in seconds. Ordinate: number of observed events. A total of 48 binding 
events were observed in 29 individual experiments under identical conditions of 226 nM 
TBP in Buffer 1 at room-temperature (22.1 ±1.5 °C, mean ± SD; interval [20 : 25] °C), using 
TATA-DNA tethered silica beads. The dynamics of the spatial distribution of TBP in the 
flow cell was modeled as a diffusive process with reflecting boundary conditions: Setting the 
diffusion coefficient of TBP to 50 /im^/s, the height of the flow cell to 160 fim, and the initial 
distribution of TBP to a delta-function, the distribution of TBP in the flow cell was found to 
be homogeneous after <60s (indicated by vertical dashed line). We proceeded by excluding 
from further analysis all events in the first 60s {n=3). A maximum likelihood, single- 
exponential fit returned a characteristic time of r = 143 ± 22 s (mean ± SE). White circles 
show expected counts in each bin, assuming an exponential distribution. Error-bars shown 
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are expected standard deviations, calculated assuming a binomial distribution of counts in 
each bin. The maximum likelihood fit does not depend on the bin-width, because the fit was 
done directly to the observed waiting times. 

Figure HI 

Direct observation of sub-steps on the TBP-DNA binding-pathway. Panels Al, Bl, and CI 
show 3 examples (n = 10 observed) of the temporal development of the Brownian RMSD 
B calculated in non-overlapping Is windows for TATA-DNA tethered 0.46 /im polystyrene 
beads. 100 /zl TBP was flowed through at time t = (indicated by arrows). Panels A2, 
B2, and C2 show histograms of B formed from the data shown in panels Al, Bl, and CI, 
respectively. Three peaks are present in each of the histograms, corresponding to three 
classes of Brownian motion. Horizontal dashed lines in the time-series-panels indicate the 
positions of the peaks in the histograms. Multiple back and forth transitions, from the 
different classes of Brownian motion, can be seen in all three examples. A-panels: 100 nM 
TBP, histogram shows B in the interval t G [50 : 400] s. B-panels: 68 nM TBP, histogram 
shows B in the interval t E [0 : 100] s. C-panels: 68 nM TBP, histogram shows B in the 
interval t G [0 : 290] s. 

Figure [5l 

Suggested step-wise order of events for the binding of TBP, TFllA, and TFIIB to DNA. 
Starting in the upper left corner, TBP binds to the TATA-box and the first pair of pheny- 
lalanines are intercalated in the 5' end of the TATA-box, producing a 45° kink. This cor- 
responds to the Ii state. Starting from this state TFIIA can bind. Next, the second pair 
of phenylalanines are intercalated, in the 3' end of the TATA-box, producing another 45° 
kink. This conformation corresponds to the I2 and final-bound state. In this conformation 
the DNA is brought close enough together that TFIIB can bind to it. 

Figure IHl 

Time series of Brownian motion for a 1 fim diameter silica bead tethered by control-DNA, in 
the presence of Xhol. Left panel: Scatter-plot showing principal axes. Right panel: The 
square-root of the two principal moments, y/Imax and y/Imm, are shown in grey and black, 
respectively. During the first 120 s the motion of the bead was highly anisotropic (isotropy 
measure r=39%). After 120 s, the motion of the bead was isotropic (r=94%); the bead 
released from the surface after 160 seconds. 
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Figure [7| 

Time-evolution of Brownian RMSD (calculated in non-overlapping 2 s windows) during laser 
tweezers experiment. The stepwise binding of TBP to DNA started approximately 80s after 
30 yul 226 nM TBP was flowed in. Approximately 20 min after addition and binding of TBP, 
the tweezers were used to pull the silica bead horizontally, thus stretching the tether. After 
this stretching, B returned to its original value, suggesting that the protein was forced off 
the DNA. 

Figure |H1 

A point P on the surface of a rigid sphere of radius R is attached to a point O on a flat 
surface by a tether of contour length L. The only constraints on our system are that the 
sphere stays in the upper half-plane and that the distance \0P\ = I < L. 6 is the angle 
between OP and the surface normal, and a is the angle between PC and the surface normal. 
Two more angles are needed to fully determine the configuration of the system: (p describes 
the rotation of OP around the surface normal through O, and /? describes the rotation of 
PC around the surface normal through P. Rotation of the sphere around PC contributes 
an additional degree of freedom, but this degree of freedom is independent of the other 
parameters, and does not change in our experiment. 
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